Query-by-Example using Speaker Content Graphs

نویسندگان

  • William M. Campbell
  • Elliot Singer
چکیده

We describe methods for constructing and using content graphs for query-by-example speaker recognition tasks within a large speech corpus. This goal is achieved as follows: First, we describe an algorithm for constructing speaker content graphs, where nodes represent speech signals and edges represent speaker similarity. Speech signal similarity can be based on any standard vector-based speaker comparison method, and the content graph can be constructed using an efficient incremental method for streaming data. Second, we apply random walk methods to the content graph to find matching examples to an unlabeled query set of speech signals. The content-graph based method is contrasted to a more traditional approach that uses supervised training and stack detectors. Performance is compared in terms of information retrieval measures and computational complexity. The new content-graph based method is shown to provide a promising low-complexity scalable alternative to standard speaker recognition methods.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Diarization-Based Speaker Retrieval for Broadcast Television Archives

In this study we extend a query-by-example diarizationbased speaker retrieval system to a full speaker retrieval system for broadcast television. The envisioned system is capable of finding all speakers in an archive using their names instead of example speech fragments. Information extracted from a television guide is used to label speaker clusters that most likely correspond to the found name...

متن کامل

Query by Example of Speaker Audio Signals using Power Spectrum and MFCCs

Search engine is the popular term for an information retrieval (IR) system. Typically, search engine can be based on full-text indexing. Changing the presentation from the text data to multimedia data types make an information retrieval process more complex such as a retrieval of image or sounds in large databases. This paper introduces the use of language and text independent speech as input q...

متن کامل

Content-Based Retrieval of Medical Images

We consider the requirements for the design and implementation of Image DataBase (IDB) systems which support the retrieval of medical images by content. Attention is focused on a methodology for the efficient representation and retrieval of medical images based on spatial information. The content of medical images is represented by Attributed Relational Graphs (ARGs) holding features of objects...

متن کامل

SpeeD @ MediaEval 2015: Multilingual Phone Recognition Approach to Query by Example STD

In this paper, we attempt to solve the Spoken Term Detection (STD) problem for under-resourced languages by a phone recognition approach within the Automatic Speech Recognition (ASR) paradigm, with multilingual acoustic models from six languages (Albanian, Czech, English, Hungarian, Romanian and Russian). The Power Normalized Cepstral Coefficients (PNCC) features are used for improved robustnes...

متن کامل

Smart Query Definition for Content-Based Search in Large Sets of Graphs

Graphs are used in various application areas such as chemical, social or shareholder network analysis. Finding relevant graphs in large graph databases is thereby an important problem. Such search starts with the definition of the query object. Defining the query graph quickly and effectively so that it matches meaningful data in the database is difficult. In this paper, we introduce a system, ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2012